High quality multi-pulse based CELP speech coding at 6.4 kb/s and its subjective evaluation
نویسندگان
چکیده
This paper proposes an MP-CELP (Multi-Pulse-based CELP) speech coding at 6.4 kb/s with 10 ms frame. In MP-CELP, amplitudes or signs of multi-pulse excitation are simultaneously vector quantized (VQ). A combination search between multiple pulse location candidates and VQ codebook remarkably improves the quantization performance. In order to improve speech quality for background noise conditions, an adaptive pulse location restriction method is developed. The subjective evaluation results show that speech quality for 6.4 kb/s MP-CELP is higher than that for G.726 at 32 kb/s and is equivalent to that for 6.3 kb/s G.723.1 with 30 ms frame in clean speech and tandem conditions. For background noise conditions, the adaptive pulse location restriction significantly improves MOS value by 0.9. The speech quality is equivalent to that for G.723.1, but still does not reach to that of 24 kb/s G.726, except interference talker condition.
منابع مشابه
4 kb/s multi-pulse based CELP speech coding using excitation switching
Thispaper proposes an MP-CELP (Multi-Pulse-based CELP) speech coding at 4 kb/s. In MP-CELP, amplitudes or signs of multi-pulse excitation are simultaneously vector quantized (VQ). In order to improve speech quality for background noise conditions, excitation signal is switched between voiced and unvoiced speech, and the number of pulse is greatly increased for unvoiced speech by restricting pul...
متن کاملHybrid MELP/CELP coding at bit rates from 6.4 to 2.4 kb/s
This paper describes extensions of the 4 kb/s hybrid MELP/CELP coder, up to 6.4 kb/s and down to 2.4 kb/s. The baseline 4 kb/s coder uses three coding modes: MELP in strongly voiced speech frames, CELP with pitch prediction in weakly voiced frames, and CELP with stochastic excitation in unvoiced frames. To minimize switching artifacts between parametric MELP and waveform CELP coding, an alignme...
متن کاملUsing Various Types of Excitation Signals
A high-qulaity speech coding method (SPMEX) at 4.8 kb/s is proposed. The SPMEX selects a suitable excitation signal, based on the decision from aconstic features of speech signal in a frame. lmproved pitch interpolation multi-pulse (PMPC) excitation is selected for vowel-like speech. In PMPC, multi-pulse during only one pitch period is calculated in the frame. Fnrther, gain and phase adjusting ...
متن کاملAnalysis by synthesis speech coding with generalized pitch prediction
A new analysis-by-synthesis speech coding structure is presented for high-quality speech coding in the 4 to 8 kb/s range. CELP with generalized pitch prediction (GPP-CELP) di ers from classical code-excited linear prediction (CELP) in that for voiced segments it is the speech signal that is decomposed into a component predictable with the aid of the adaptive codebook (ACB) and a nonpredictable ...
متن کاملHigh quality MELP coding at bit-rates around 4 kb/s
Recently, a number of coding techniques have been reported to achieve near toll quality synthesized speech at bit-rates around 4 kb/s. These include variants of Code Excited Linear Prediction (CELP), Sinusoidal Transform Coding (STC) and Multi-Band Excitation (MBE). While CELP has been an effective technique for bit-rates above 6 kb/s, STC, MBE, Waveform Interpolation (WI) and Mixed Excitation ...
متن کامل